Experiments in Linear Template Combination using Genetic Algorithms
نویسندگان
چکیده
Natural Language Generation systems typically have two parts strategic (" what to say ") and tactical (" how to say "). We present our experiments in building an unsupervised corpusdriven template based tactical NLG system. We consider templates as a sequence of words containing gaps. Our idea is based on the observation that templates are grammatical locally (within their textual span). We posit the construction of a sentence as a highly restricted sequence of such templates. This work is an attempt to explore the resulting search space using Genetic Algorithms to arrive at acceptable solutions. We present a baseline implementation of this approach which outputs gapped text. NLG is the task of generating natural language from nonlinguistic inputs. Most NLG systems can be classified into two broad camps template based and statistical. Template based systems are generally characterized by structure gapped text (slotfiller structure) which is predominantly manually created (Reiter 1995) which generally result in high quality text but also limited linguistic coverage. Statistical systems such as (Langkilde 1998) on the other hand, use datadriven algorithms for text generation and have littletono reliance on repetitive manual resources making them more adaptable and maintainable, albeit with lesser text quality (Reiter 1995). In this work, we consider generation on a sentence level and output gapped text. We define templates as gapped text which can be filled to generate textual output. A (partial) sentence is a linear sequence of such templates. Since there are a large number of choices (templates) at every step of sentence generation (sequence of templates), it naturally gives rise to a search space which contains all such sequences grammatical and ungrammatical. The challenge is to navigate this large search space and arrive at reasonable grammatical sentences. RELEVANT WORK Template based systems have been considered to have a very shallow linguistic representation and mapping of the nonlinguistic data to the generated text. General consensus among the NLG community has been that template based systems are not as flexible, maintainable and expressive (linguistic coverage) as full fledged NLG systems (Reiter 1995). (Reiter 1997) mentions that template based systems and NLG systems are " turing 1
منابع مشابه
Modeling and scheduling no-idle hybrid flow shop problems
Although several papers have studied no-idle scheduling problems, they all focus on flow shops, assuming one processor at each working stage. But, companies commonly extend to hybrid flow shops by duplicating machines in parallel in stages. This paper considers the problem of scheduling no-idle hybrid flow shops. A mixed integer linear programming model is first developed to mathematically form...
متن کاملStatic Task Allocation in Distributed Systems Using Parallel Genetic Algorithm
Over the past two decades, PC speeds have increased from a few instructions per second to several million instructions per second. The tremendous speed of today's networks as well as the increasing need for high-performance systems has made researchers interested in parallel and distributed computing. The rapid growth of distributed systems has led to a variety of problems. Task allocation is a...
متن کاملGreen Space Suitability Analysis Using Evolutionary Algorithm and Weighted Linear Combination (WLC) Method
With current new urban developments, no balance can be found between green spaces and open areas present within urban networks and natural land patterns since urban networks are dominating ecological networks. Accordingly, one of the major tasks of urban and regional planners is the optimal land use allocation to urban green spaces. Therefore, to achieve this goal in this research, locations of...
متن کاملراهکار ترکیبی نوین جهت تشخیص نفوذ در شبکههای کامپیوتری با استفاده از الگوریتم-های هوش محاسباتی
In this paper, a novel hybrid method is proposed for intrusion detection in computer networks using combination of misuse-based and anomaly-based detection models with the aim of performance improvement. In the proposed hybrid approach, a set of algorithms and models is employed. The selection of input features is performed using shuffled frog-leaping (SFL) algorithm. The misuse detection modul...
متن کاملOptimization of Array Factor in Linear Arrays Using Modified Genetic Algorithm
The array factor (sidelobe level, SLL) of a linear array is optimized using modified continuous genetic algorithms in this work. The amplitudes and phases of the currents as well as the separation of the antennas are all taken as variables to be controlled. The results of the design using modified GA versions are compared with other methods. Two design problems were studied using several contin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1605.07366 شماره
صفحات -
تاریخ انتشار 2016